Skip to Main Content
Coursera
  • Browse
  • Top Courses
  • Profile
  • My Purchases
  • Get 50% Off
  • Settings
  • Updates
  • Accomplishments
  • Help Center
    Coursera
    1. Structuring Machine Learning Projects
    2. Week 2
    3. Autonomous Driving (Case Study)
    PreviousNext
      • Completed
        Video: LectureCarrying Out Error Analysis
        . Duration: 10 minutes10 min
      • Completed
        Video: LectureCleaning Up Incorrectly Labeled Data
        . Duration: 13 minutes13 min
      • Completed
        Video: LectureBuild your First System Quickly, then Iterate
        . Duration: 6 minutes6 min
      • Completed
        Video: LectureTraining and Testing on Different Distributions
        . Duration: 10 minutes10 min
      • Completed
        Video: LectureBias and Variance with Mismatched Data Distributions
        . Duration: 18 minutes18 min
      • Completed
        Video: LectureAddressing Data Mismatch
        . Duration: 10 minutes10 min
      • Completed
        Video: LectureTransfer Learning
        . Duration: 11 minutes11 min
      • Completed
        Video: LectureMulti-task Learning
        . Duration: 12 minutes12 min
      • Completed
        Video: LectureWhat is End-to-end Deep Learning?
        . Duration: 11 minutes11 min
      • Completed
        Video: LectureWhether to use End-to-end Deep Learning
        . Duration: 10 minutes10 min
      • Completed
        Quiz: Autonomous Driving (Case Study)
        15 questions
      • Video
        Video: LectureRuslan Salakhutdinov Interview
        . Duration: 17 minutes17 min
      • Reading
        Reading: Acknowledgments
        . Duration: 10 minutes10 min
    QuizQuiz • 45 min45 minutes

    Autonomous Driving (Case Study)

    Activity Completed
    Submit your assignment
    Due DateJun 14, 2:59 AM EDTJune 14, 2:59 AM EDT
    Attempts3 every 8 hours
    Retake the quiz in 7h 55m
    Activity Completed
    Receive grade
    To Pass80% or higher
    Grade
    100%
    We keep your highest score

    Confirm Navigation

    Are you sure you want to leave this page?
      
    ˆ
    Autonomous Driving (Case Study)
    Graded Quiz • 45 min

    Due Jun 14, 2:59 AM EDT

    Check
    Congratulations! You passed!
    To Pass 80% or higher
    Retake the assignment in 7h 55m
    Grade
    100%

    Autonomous Driving (Case Study)

    Latest Submission Grade
    100%
    1.
    Question 1

    To help you practice strategies for machine learning, in this week we’ll present another scenario and ask how you would act. We think this “simulator” of working in a machine learning project will give a task of what leading a machine learning project could be like!

    You are employed by a startup building self-driving cars. You are in charge of detecting road signs (stop sign, pedestrian crossing sign, construction ahead sign) and traffic signals (red and green lights) in images. The goal is to recognize which of these objects appear in each image. As an example, the above image contains a pedestrian crossing sign and red traffic lights

    Your 100,000 labeled images are taken using the front-facing camera of your car. This is also the distribution of data you care most about doing well on. You think you might be able to get a much larger dataset off the internet, that could be helpful for training even if the distribution of internet data is not the same.

    You are just getting started on this project. What is the first thing you do? Assume each of the steps below would take about an equal amount of time (a few days).

    1 / 1 point
    Check
    Correct

    As discussed in lecture, applied ML is a highly iterative process. If you train a basic model and carry out error analysis (see what mistakes it makes) it will help point you in more promising directions.

    2.
    Question 2

    Your goal is to detect road signs (stop sign, pedestrian crossing sign, construction ahead sign) and traffic signals (red and green lights) in images. The goal is to recognize which of these objects appear in each image. You plan to use a deep neural network with ReLU units in the hidden layers.

    For the output layer, a softmax activation would be a good choice for the output layer because this is a multi-task learning problem. True/False?

    1 / 1 point
    Check
    Correct

    Softmax would be a good choice if one and only one of the possibilities (stop sign, speed bump, pedestrian crossing, green light and red light) was present in each image.

    3.
    Question 3

    You are carrying out error analysis and counting up what errors the algorithm makes. Which of these datasets do you think you should manually go through and carefully examine, one image at a time?

    1 / 1 point
    Check
    Correct

    Focus on images that the algorithm got wrong. Also, 500 is enough to give you a good initial sense of the error statistics. There’s probably no need to look at 10,000, which will take a long time.

    4.
    Question 4

    After working on the data for several weeks, your team ends up with the following data:

    • 100,000 labeled images taken using the front-facing camera of your car.
    • 900,000 labeled images of roads downloaded from the internet.
    • Each image’s labels precisely indicate the presence of any specific road signs and traffic signals or combinations of them. For example, y(i)y^{(i)}y(i) = [10010]
      ⎡⎣⎢⎢⎢⎢10010⎤⎦⎥⎥⎥⎥
      ⎣⎢⎢⎢⎢⎢⎡​10010​⎦⎥⎥⎥⎥⎥⎤​
      means the image contains a stop sign and a red traffic light.

    Because this is a multi-task learning problem, you need to have all your y(i)y^{(i)}y(i) vectors fully labeled. If one example is equal to [0?11?]

    ⎡⎣⎢⎢⎢⎢0?11?⎤⎦⎥⎥⎥⎥
    ⎣⎢⎢⎢⎢⎢⎡​0?11?​⎦⎥⎥⎥⎥⎥⎤​ then the learning algorithm will not be able to use that example. True/False?

    1 / 1 point
    Check
    Correct

    As seen in the lecture on multi-task learning, you can compute the cost such that it is not influenced by the fact that some entries haven’t been labeled.

    5.
    Question 5

    The distribution of data you care about contains images from your car’s front-facing camera; which comes from a different distribution than the images you were able to find and download off the internet. How should you split the dataset into train/dev/test sets?

    1 / 1 point
    Check
    Correct

    Yes. As seen in lecture, it is important that your dev and test set have the closest possible distribution to “real”-data. It is also important for the training set to contain enough “real”-data to avoid having a data-mismatch problem.

    6.
    Question 6

    Assume you’ve finally chosen the following split between of the data:

    Dataset:Contains:Error of the algorithm:
    Training940,000 images randomly picked from (900,000 internet images + 60,000 car’s front-facing camera images)8.8%
    Training-Dev20,000 images randomly picked from (900,000 internet images + 60,000 car’s front-facing camera images)9.1%
    Dev20,000 images from your car’s front-facing camera14.3%
    Test20,000 images from the car’s front-facing camera14.8%

    You also know that human-level error on the road sign and traffic signals classification task is around 0.5%. Which of the following are True? (Check all that apply).

    1 / 1 point
    Check
    Correct
    Check
    Correct
    7.
    Question 7

    Based on the table from the previous question, a friend thinks that the training data distribution is much easier than the dev/test distribution. What do you think?

    1 / 1 point
    Check
    Correct

    The algorithm does better on the distribution of data it trained on. But you don’t know if it’s because it trained on that no distribution or if it really is easier. To get a better sense, measure human-level error separately on both distributions.

    8.
    Question 8

    You decide to focus on the dev set and check by hand what are the errors due to. Here is a table summarizing your discoveries:

    Overall dev set error15.3%
    Errors due to incorrectly labeled data4.1%
    Errors due to foggy pictures8.0%
    Errors due to rain drops stuck on your car’s front-facing camera2.2%
    Errors due to other causes1.0%

    In this table, 4.1%, 8.0%, etc. are a fraction of the total dev set (not just examples your algorithm mislabeled). For example, about 8.0/15.3 = 52% of your errors are due to foggy pictures.

    The results from this analysis implies that the team’s highest priority should be to bring more foggy pictures into the training set so as to address the 8.0% of errors in that category. True/False?

    Additional Note: there are subtle concepts to consider with this question, and you may find arguments for why some answers are also correct or incorrect. We recommend that you spend time reading the feedback for this quiz, to understand what issues that you will want to consider when you are building your own machine learning project.

    1 / 1 point
    Check
    Correct

    correct: feedback: This is the correct answer. You should consider the tradeoff between the data accessibility and potential improvement of your model trained on this additional data.

    9.
    Question 9

    You can buy a specially designed windshield wiper that help wipe off some of the raindrops on the front-facing camera. Based on the table from the previous question, which of the following statements do you agree with?

    1 / 1 point
    Check
    Correct

    Yes. You will probably not improve performance by more than 2.2% by solving the raindrops problem. If your dataset was infinitely big, 2.2% would be a perfect estimate of the improvement you can achieve by purchasing a specially designed windshield wiper that removes the raindrops.

    10.
    Question 10

    You decide to use data augmentation to address foggy images. You find 1,000 pictures of fog off the internet, and “add” them to clean images to synthesize foggy days, like this:

    Which of the following statements do you agree with?

    1 / 1 point
    Check
    Correct

    Yes. If the synthesized images look realistic, then the model will just see them as if you had added useful data to identify road signs and traffic signals in a foggy weather. I will very likely help.

    11.
    Question 11

    After working further on the problem, you’ve decided to correct the incorrectly labeled data on the dev set. Which of these statements do you agree with? (Check all that apply).

    1 / 1 point
    Check
    Correct

    True, deep learning algorithms are quite robust to having slightly different train and dev distributions.

    Check
    Correct

    Yes because you want to make sure that your dev and test data come from the same distribution for your algorithm to make your team’s iterative development process is efficient.

    12.
    Question 12

    So far your algorithm only recognizes red and green traffic lights. One of your colleagues in the startup is starting to work on recognizing a yellow traffic light. (Some countries call it an orange light rather than a yellow light; we’ll use the US convention of calling it yellow.) Images containing yellow lights are quite rare, and she doesn’t have enough data to build a good model. She hopes you can help her out using transfer learning.

    What do you tell your colleague?

    1 / 1 point
    Check
    Correct

    Yes. You have trained your model on a huge dataset, and she has a small dataset. Although your labels are different, the parameters of your model have been trained to recognize many characteristics of road and traffic images which will be useful for her problem. This is a perfect case for transfer learning, she can start with a model with the same architecture as yours, change what is after the last hidden layer and initialize it with your trained parameters.

    13.
    Question 13

    Another colleague wants to use microphones placed outside the car to better hear if there are other vehicles around you. For example, if there is a police vehicle behind you, you would be able to hear their siren. However, they don’t have much to train this audio system. How can you help?

    1 / 1 point
    Check
    Correct

    Yes. The problem he is trying to solve is quite different from yours. The different dataset structures make it probably impossible to use transfer learning or multi-task learning.

    14.
    Question 14

    To recognize red and green lights, you have been using this approach:

    • (A) Input an image (x) to a neural network and have it directly learn a mapping to make a prediction as to whether there’s a red light and/or green light (y).

    A teammate proposes a different, two-step approach:

    • (B) In this two-step approach, you would first (i) detect the traffic light in the image (if any), then (ii) determine the color of the illuminated lamp in the traffic light.

    Between these two, Approach B is more of an end-to-end approach because it has distinct steps for the input end and the output end. True/False?

    1 / 1 point
    Check
    Correct

    Yes. (A) is an end-to-end approach as it maps directly the input (x) to the output (y).

    15.
    Question 15

    Approach A (in the question above) tends to be more promising than approach B if you have a ________ (fill in the blank).

    1 / 1 point
    Check
    Correct

    Yes. In many fields, it has been observed that end-to-end learning works better in practice, but requires a large amount of data.